Goto

Collaborating Authors

 Tabasco


ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

arXiv.org Artificial Intelligence

Large Language Models (LLMs) have shown remarkable capabilities in reasoning, exemplified by the success of OpenAI-o1 and DeepSeek-R1. However, integrating reasoning with external search processes remains challenging, especially for complex multi-hop questions requiring multiple retrieval steps. We propose ReSearch, a novel framework that trains LLMs to Reason with Search via reinforcement learning without using any supervised data on reasoning steps. Our approach treats search operations as integral components of the reasoning chain, where when and how to perform searches is guided by text-based thinking, and search results subsequently influence further reasoning. We train ReSearch on Qwen2.5-7B(-Instruct) and Qwen2.5-32B(-Instruct) models and conduct extensive experiments. Despite being trained on only one dataset, our models demonstrate strong generalizability across various benchmarks. Analysis reveals that ReSearch naturally elicits advanced reasoning capabilities such as reflection and self-correction during the reinforcement learning process.


Hallucination Diversity-Aware Active Learning for Text Summarization

arXiv.org Artificial Intelligence

Large Language Models (LLMs) have shown propensity to generate hallucinated outputs, i.e., texts that are factually incorrect or unsupported. Existing methods for alleviating hallucinations typically require costly human annotations to identify and correct hallucinations in LLM outputs. Moreover, most of these methods focus on a specific type of hallucination, e.g., entity or token errors, which limits their effectiveness in addressing various types of hallucinations exhibited in LLM outputs. To our best knowledge, in this paper we propose the first active learning framework to alleviate LLM hallucinations, reducing costly human annotations of hallucination needed. By measuring fine-grained hallucinations from errors in semantic frame, discourse and content verifiability in text summarization, we propose HAllucination Diversity-Aware Sampling (HADAS) to select diverse hallucinations for annotations in active learning for LLM finetuning. Extensive experiments on three datasets and different backbone models demonstrate advantages of our method in effectively and efficiently mitigating LLM hallucinations.


Towards Modeling Learner Performance with Large Language Models

arXiv.org Artificial Intelligence

Recent work exploring the capabilities of pre-trained large language models (LLMs) has demonstrated their ability to act as general pattern machines by completing complex token sequences representing a wide array of tasks, including time-series prediction and robot control. This paper investigates whether the pattern recognition and sequence modeling capabilities of LLMs can be extended to the domain of knowledge tracing, a critical component in the development of intelligent tutoring systems (ITSs) that tailor educational experiences by predicting learner performance over time. In an empirical evaluation across multiple real-world datasets, we compare two approaches to using LLMs for this task, zero-shot prompting and model fine-tuning, with existing, non-LLM approaches to knowledge tracing. While LLM-based approaches do not achieve state-of-the-art performance, fine-tuned LLMs surpass the performance of naive baseline models and perform on par with standard Bayesian Knowledge Tracing approaches across multiple metrics. These findings suggest that the pattern recognition capabilities of LLMs can be used to model complex learning trajectories, opening a novel avenue for applying LLMs to educational contexts. The paper concludes with a discussion of the implications of these findings for future research, suggesting that further refinements and a deeper understanding of LLMs' predictive mechanisms could lead to enhanced performance in knowledge tracing tasks.


INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation

arXiv.org Artificial Intelligence

We introduce a method that dramatically reduces fine-tuning VRAM requirements and rectifies quantization errors in quantized Large Language Models. First, we develop an extremely memory-efficient fine-tuning (EMEF) method for quantized models using Low-Rank Adaptation (LoRA), and drawing upon it, we construct an error-correcting algorithm designed to minimize errors induced by the quantization process. Our method reduces the memory requirements by up to 5.6 times, which enables fine-tuning a 7 billion parameter Large Language Model (LLM) on consumer laptops. At the same time, we propose a Low-Rank Error Correction (LREC) method that exploits the added LoRA layers to ameliorate the gap between the quantized model and its float point counterpart. Our error correction framework leads to a fully functional INT2 quantized LLM with the capacity to generate coherent English text. To the best of our knowledge, this is the first INT2 Large Language Model that has been able to reach such a performance. The overhead of our method is merely a 1.05 times increase in model size, which translates to an effective precision of INT2.1. Also, our method readily generalizes to other quantization standards, such as INT3, INT4, and INT8, restoring their lost performance, which marks a significant milestone in the field of model quantization. The strategies delineated in this paper hold promising implications for the future development and optimization of quantized models, marking a pivotal shift in the landscape of low-resource machine learning computations.


Hundreds of ancient ceremonial sites are found hidden in Mexico

Daily Mail - Science & tech

Hundreds of newly-discovered ancient ceremonial sites in Mexico reveal how the Mayans adopted a mysterious design trait from the older Olmec civilization more than 3,000 years ago, a study shows. Researchers have revealed that there are 478 ceremonial complexes that can't be seen with the human eye in modern-day southern Mexico, but can be detected with lidar scanning technology. The hundreds of ceremonial complexes are a combination of Maya and older Olmec sites, according to the study authors. Originating around 2600 BC, the Maya civilization thrived in Central America for nearly 3,000 years, reaching its height between AD 250 to 900. The Olmecs, meanwhile, were another Mesoamerican civilization who occupied the land earlier, from around 2,500 to 400 BC.


27 Maya ritual sites discovered on online map by eagle-eyed archaeologist

FOX News

Researchers have uncovered a 1,500-year-old stucco mask of Maya ruler K'inich Janaab'Pakal. What differentiates this mask from others is it's seemingly made in the king's likeness. An eagle-eyed archaeologist has used a freely available online map to locate 27 Maya ceremonial sites in Mexico. Takeshi Inomata, a professor of archaeology at the University of Arizona, made the discovery using a LiDAR (Light Detection and Ranging) map he found online last year, according to the New York Times. LiDAR technology harnesses a laser to measure distances to the Earth's surface and can prove extremely valuable to study what is hidden in areas with thick vegetation.


Pompeo accuses Iran of 'unprecedented attack' after drones hit Saudi oil facilities

FOX News

The attack comes after Iran exceeded their enriched uranium stockpile limit in the nuclear deal. Secretary of State Mike Pompeo called on the international community to join him Saturday in condemning Iran for drone attacks on two Saudi oil facilities, which he described as "an unprecedented attack on the world's energy supply." "Tehran is behind nearly 100 attacks on Saudi Arabia while [President Hassan] Rouhani and [Foreign Minister Mohammad] Zarif pretend to engage in diplomacy," Pompeo tweeted, referring to the nation's president and foreign affairs minister. There is no evidence the attacks came from Yemen." Iran-backed Houthi rebels in Yemen claimed responsibility for the attack hours before Pompeo's tweet. The world's largest oil processing facility in Saudi Arabia and a major oil field were impacted, sparking huge fires at a vulnerable chokepoint for global energy supplies. "The United States will work with our partners and allies to ensure that energy markets remain well supplied and Iran is held accountable for its aggression," Pompeo concluded. According to multiple news reports that cited unidentified sources, the drone attacks affected up to half of the supplies from the world's largest exporter of oil, though the output should be restored within days. It remained unclear if anyone was injured at the Abqaiq oil processing facility and the Khurais oil field. Sen. Chris Murphy, D-Conn., who sits on the Senate Foreign Relations Committee, denounced Pompeo's description of the attack, calling it an "irresponsible simplification." "The Saudis and Houthis are at war.